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Abstract. In this paper we study quantum nondeterminism in multi- 
party communication. There are three (possibly) different types of non- 
determinism in quantum computation: i) strong, ii) weak with classical 
proofs, and iii) weak with quantum proofs. Here we focus on the first 
one. A strong quantum nondeterministic protocol accepts a correct input 
with positive probability, and rejects an incorrect input with probability 
1. In this work we relate strong quantum nondeterministic multiparty 
communication complexity to the rank of the communication tensor in 
the Number-On-Forehead and Number-In-Hand models. In particular, 
by extending the definition proposed by de Wolf to nondeterministic 
tensor-rank (nrank), we show that for any boolean function /, 1) in the 
Number-On- Forehead model, the cost is upper-bounded by the logarithm 
of nrank(f); 2) in the Number-In-Hand model, the cost is lower-bounded 
by the logarithm of nrank(f). This naturally generalizes previous results 
in the field and relates for the first time the concept of (high-order) ten- 
sor rank to quantum communication. Furthermore, we show that strong 
quantum nondeterminism can be exponentially stronger than classical 
multiparty nondeterministic communication. We do so by applying our 
results to the matrix multiplication problem. 

Keywords: multiparty communication, quantum nondeterminism, ten- 
sor rank, exponential separation, matrix multiplication 

1 Introduction 

Background Nondeterminism plays a fundamental role in complexity theory. 
For instance, the P vs NP problem asks if nondeterministic time is strictly 
more powerful than deterministic time. Even though nondeterministic models 
are unrealistic, they can give insights into the power and limitations of realistic 
models (i.e., deterministic, random, etc.). 

There are two ways of defining a nondeterministic machine, using random- 
ness or as a proof system: a nondeterministic machine i) accepts a correct input 



with positive probability, and rejects an incorrect input with probability one; or 
ii) is a deterministic machine that receives besides the input, a proof or certifi- 
cate which exists if and only if the input is correct. For classical machines (i.e., 
machines based on classical mechanics), these two notions of nondeterminism are 
equivalent. However, in the quantum setting they can be different. In fact, these 
two notions give rise to (possibly) three different kinds of quantum nondetermin- 
ism. In strong quantum nondeterminism, the quantum machine accepts a correct 
input with positive probability. In weak quantum nondeterminism, the quantum 
machine outputs the correct answer when supplied with a correct proof, which 
could be either classical or quantum. 

The study of quantum nondeterminism in the context of query and commu- 
nication complexities started with de Wolf [TB]- In particular, de Wolf [TB] intro- 
duced the notion of nondeterministic rank of a matrix, which was later proved to 
completely characterize strong quantum nondeterministic communication |17j . 
In the same piece of work, it was proved that strong quantum nondeterministic 
protocols are exponentially stronger than classical nondeterministic protocols. 
In the same spirit, Le Gall |S] studied weak quantum nondeterministic com- 
munication with classical proofs and showed a quadratic separation for a total 
function. 

Weak nondeterminism seems a more suitable definition, mainly due to the 
requirement of the existence of a proof, a concept that plays fundamental roles 
in complexity theory. In contrast, strong nondeterminism lends itself to a natu- 
ral mathematical description in terms of matrix rank. Moreover, strong nonde- 
terminism is a more powerful model capable of simulating weak nondetermin- 
ism with classical and quantum proofs. The reverse, if weak nondeterminism is 
strictly a less powerful model or not is still an open problem. 

The previous results by de Wolf [17] and Le Gall [9] were on the context of 
2-party communication complexity, i.e., there are two players with two inputs 
x and y each, and they want to compute a function f(x,y). Let rank(f) be 
the rank of the communication matrix Mf, where Mf[x,y] = f[x,y). A known 
result is | log rank(f) < Q(f) < D(f) [2J, where D(f) is the deterministic 
communication complexity of / and Q(f) the quantum exact communication 
complexitjQ. It is conjectured that D(f) = 0(log c rank) for some arbitrary 
constant c. This is the log-rank conjecture in communication complexity, one 
the biggest open problems in the field. If it holds, implies that Q(f) and D(f) 
are polynomially related. This is in contrast to the characterization given by de 
Wolf [17 in terms of the nondeterministic matrix-rank, which is defined as the 
minimal rank of a matrix (over the complex field) whose (x, y)-entry is non-zero 
if and only if f(x,y) = 1. 

Contributions In this paper, we continue with the study of strong quantum 
nondeterminism in the context of multiparty protocols. Let k > 3 be the num- 
ber of players evaluating a function /(xi, . . . , Xfc). The players take turns pre- 
defined at the beginning of the protocol. Each time a player sends a bit (or 



4 All logarithms in this paper are base 2. 



qubit if it is a quantum protocol), he sends it to the player who follows next. 
The communication complexity of the protocol is defined as the minimum num- 
ber of bits that need to be transmitted by the players in order to compute 
f{x\, . . . , x k ). There are two common ways of communication: The Number-On- 
Forehead model (NOF) , where player i knows all inputs except Xi ; and, Number- 
In-Hand model (NIH), where player i only knows X{. Also, any protocol naturally 
defines a communication tensor Tf, where Tf[xi, . . . , x k ] = f{x\, . . . , x k ). 

Tensors are natural generalizations of matrices. They are defined as multi- 
dimensional arrays while matrices are 2-dimensional arrays. In the same way, 
the concept of matrix rank extends to tensor rank. However, the nice properties 
of matrix rank do not hold anymore for tensors; for instance, the rank could be 
different if the same tensor is defined over different fields [5]. 

We extend the concept of nondeterministic matrices to nondeterministic ten- 
sors. The nondeterministic tensor rank, denoted nrank(f), is the minimal rank 
of a tensor (over the complex field) whose (x%, . . . , Xfc)-entry is non-zero if and 
only if /(xi, ...,x k ) = 1. 

Let NQ^ OF and NQ^ IH denote the fc-party strong quantum nondetermin- 
istic communication complexity for the NOF and NIH models respectively. 

Theorem 1. Let f : ({0, l}") fc -> {0, 1}, then NQ^ OF (f) < \\og nrank(f)~\ + 
I, and NQ^ IH {f) > flog nrank(f)] + 1. 

This theorem generalizes the previous result by de Wolf, as it can be seen 
that by letting k = 2 we obtain exactly [17[ Lemma 3.21. Also, since NQ^ IH 



is a lower bound for exact NIH quantum communication^!, denoted Q^ IH , we 
obtain the following corollary: 



One of the first direct consequences of Theorem[T]is on the equality function. 
The A;-party equality function EQ k {x\, . . . , x k ) = 1 if and only if x\ = ■ ■ ■ = 
Xk- A nondeterministic tensor for EQ k is superdiagonal with non-zero entries 
in the main diagonal, and anywhere else. Thus, it has 2™ rank, and implies 
NQ^ OF (EQ k ) < n + 1 and NQ^ IH (EQ k ) > n + 1. However, note that the 
communication complexity of EQ k is upper-bounded by 0(n) in the NOF model, 
however this could be a very loose bound. In general, NQ^ OF cannot be lower- 
bounded by \ognrank. To see this, it is easy to show that in the NOF model 
there exists a classical protocol for EQ k with a cost of 2 bitfl In contrast, the 
lower bound on NQ^ IH (EQ k ) is not that loose; using the trivial protocol, where 
all players send their inputs, we have that NQ^ IH (EQ k ) — O(kn). 

5 An exact quantum protocol accepts a correct input and rejects an incorrect input 
with probability 1. 

6 Let the first player check if X2, . . . , x k are equal. If they are, he sends a 1 bit to the 
second player, who will check if x\, X3, . . . , x k are equal. If his strings are equal and 
he received a 1 bit from the first player, he sends a 1 bit to all players indicating 
that all strings are equal [3 Example 6.3]. 




Corollary 2. [log nrank(f)] + I < Q k IH {f)- 



A more interesting function is the generalized inner product defined formally 
as GIP k {x x ,. . .,x k ) = (£*Li A" = i Xij) mod 2. We know that (2™ - l)fc/2 < 
nrank(GIP k ) (see Appendix [£] for a proof), and thus, NQ^ IH (GIP k ) > n + 
|~log(fc/2)] + 1. In NIH, using the trivial protocol where each player send their 
inputs, we obtain (with Corollary [2]) a bound in quantum exact communication 
of [log(fc/2)l +n + l < Q^ IH {GIP k ) < (fc-l)n+l. Improving the lower bound 
will require new techniques for explicit construction of linear-rank tensors, with 
important consequences to circuit lower bounds [TS] (see for example the paper 
by Alexeev, Forbes, and Tsimerman [T] for state-of-the-art tensor constructions). 
In general, we are still unable to upper-bound NQ^ IH (/) in terms of log nrank. 

Although the bounds given by Theorem [T] could be loose for some functions, 
they are good enough for other applications. For instance, we show in Section 
U] a separation between the NOF models of strong quantum nondeterminism 
and classical nondeterminism. We do so by applying Theorem [T] to the matrix 
multiplication problem. This separation is super-polynomial when k = o(logn), 
and exponential when k = 0(1). To our knowledge, this is the first exponential 
quantum-classical separation for a total function in any multiparty communica- 
tion modeQ- 

2 Preliminaries 

In this paper we assume basic knowledge of communication complexity and 
quantum computing. We refer the interested reader to the books by Kushilevitz 
and Nisan [7] and Nielsen and Chuang [12] respectively. In this section we give 
a small review of tensors and quantum communication. 

2.1 Tensors 

A tensor is a multi-dimensional array defined over some field. An order-d tensor 
is an element of the tensor product of d vector spaces. 

Definition 3 (Simple Tensor). Let \vi) g V n ' be an ni- dimensional vector 
for 1 < i < d on some vector space V ni . The j\ component of \vi) is de- 
noted by Vi(ji) for 1 < j% < n^. The tensor product of {\vi)} is the tensor T e 
V ni ® •• • (g> V nd whose (ji, . . . ,jd)-entry is «i(ji) • • • v d (jd), i.e., T\j\, . . . , j d ] = 
Vi(ji) ■ • ■ Vd{jd). Then T = \v\) (g) • • ■ ® \vd) and we say T is a rank-1 or simple 
order-d tensor. We also say that a tensor is of high order if its order is three or 
higher. 

From now on, we will refer to high-order tensors simply as tensors, and low- 
order tensor will be matrices, vectors, and scalars as usual. 

7 A previous separation, super-polynomial when k = o{ \/\og n / log log n) and expo- 
nential when k — 0(1), was found by Gavinsky and Pudlak [3] for a relational 
communication problem in the simultaneous message passing model. 



It is important to note that the set of simple tensors span the space V ni <g> 
• • • (g) V nd , and hence, there exists tensors that are not simple. This leads to the 
definition of rank. 

Definition 4 (Tensor Rank). The rank of a tensor T is the minimum r such 
that T — f or simple tensors A4. 

This agrees with the definition of matrix rank. The complexity of computing 
tensor rank was studied by Hastad [4] who showed that it is NP-complete for 
any finite field, and NP-hard for the rational numbers. 

The process of arranging the elements of an order-fc tensor into a matrix is 
known as matrization. Since there are many ways of embedding a tensor into a 
matrix, in general the permutation of columns is not important, as long as the 
corresponding operations remain consistent [6]. 

2.2 Strong Quantum Nondeterministic Multiparty Communication 

In a multiparty communication protocol there are k > 3 players trying to 
compute a function /. Let / : X k — > {0, 1} be a function on k strings x = 
{x\, . . . , Xk), where each Xi £ X and X = {0, 1}™. There are two common ways 
of communication between the players: The Number-In-Hand (NIH) and the 
Number-On- Forehead (NOF) models. In NIH, player i only knows Xi, and in 
NOF, player i knows all inputs except Xi. First we review the classical defintion. 

Definition 5 (Classical nondeterministic multiparty protocol). Let k be 

the number of players. Besides the input x, the protocol receives a proof or certifi- 
cate c S {0, The players take turns in an order predefined at the beginning 
of the protocol. To communicate, a player sends exactly one bit to the player that 
follows next. The computation of the protocol ends when the last player computes 
f . If f(x) = 1 then, there exists a c that makes the protocol accept the input, 
i.e., the last player outputs 1. If f{x) = then, the protocol rejects the input for 
all c, i.e., the last player outputs 0. The cost of the protocol is the length of c 
plus the total number of bits communicated. 

Hence, the classical nondeterministic multiparty communication complexity, 
denoted Nk(f), is defined as the minimum number of bits required to compute 
f{x). If the model is NIH or NOF, we add a superscript N^ IH (f) or N^ OF (f) 
respectively. Note that, the definition of the multiparty protocols in this paper 
(classical and quantum) are all unicast, i.e., a player sends a bit only to the player 
that follows next. This is in contrast to the more common blackboard model. In 
this latter model, when a player sends a bit, he does so by broadcasting it and 
reaching all players inmediately. Clearly, any lower bound on the blackboard 
model is a lower bound for the unicast model. 

To model NOF and NIH in the quantum setting, we follow the work of Lee, 
Schechtman, and Shraibman [10], as originally defined by Kerenidis [5]. 



Definition 6 (Quantum multiparty protocol). Let k be the number of play- 
ers in the protocol. Define the Hilbert space by "Hi <8> • • • ® Hk ®C, where each Hi 
is the Hilbert space of player i, and C is the one qubit channel. To communicate 
the players take turns predefined at the beginning of the protocol. On the turn of 
player i: 

1. in NIH, an arbitrary unitary that only depends on Xi is applied on %i ® C , 
and acts as the identity anywhere else; 

2. in NOF, an arbitrary unitary independent of Xi is applied on Hi ® C , and 
acts as the identity anywhere else. 

The cost of the protocol is the number of rounds. 

If there is no entanglement, the initial state is a pure state 0} £g> ■ • • £g> |0)|0). 
In general, the initial state could be anything that is independent of the input 
with no prior entanglement. If the final state of the protocol on input X\, . . . ,Xk 
is \tp), it outputs 1 with probability p(xi, Xk) = (V'l-^ilV'): where III is a 
projection onto the |1) state of the channel. 

We say that T is a nondeterministic communication tensor if T[xi, ... , Xk] ^ 
if and only if f(xx, . . . , X}~) — 1. Thus, T can be obtained by replacing each 
1-entry in the original communication tensor by a non-zero complex number. 
We also define the nondeterministic rank of /, denoted nrank(f), to be the 
minimum rank over the complex field among all nondeterministic tensors for /. 

Definition 7 (Strong Quantum Nondeterministic Protocol). A k-party 
strong quantum nondeterministic communication protocol outputs 1 with positive 
probability if and only if f(x) = 1. 

The fc-party quantum nondeterministic communication complexity, denoted 
NQk(f), is the cost of an optimum (i.e., minimal cost) fc-party quantum non- 
deterministic communication protocol. If the model is NIH or NOF, we add a 
superscript NQ^ IH (/) or NQ^ OF '(/) respectively. From the definition it follows 
that NQk is a lower bound for the exact quantum communication complexity 
Q k for both NOF and NIH. 

Lemma 8 (Lee, Schechtman, and Shraibman |10j). After £ qubits of com- 
munication on input (xi, . . . , Xk), the state of a quantum protocol without shared 
entanglement can be written as 

\A l m (x 1 ))\Al(x 2 ))---\A k m (x k ))\me), 

m6{0,l} f 

where m is the message sent so far, mi is the t-th bit in the message, and each 
vector \A t m (x t )) corresponds to the t-th player which depends on m and the input 
x l . If the protocol is NOF then x = (xi, . . . , Xt—i, Xt+\, ■ ■ ■ , Xk); if it is NIH then 
x l = (x t ). 



3 Proof of Theorem Q] 



The arguments in this section are generalizations of a previous result by de Wolf 
|17j from 2-party to fc-party communication. 

First we need the following technical lemma. It is a generalization of [T71 
Lemma 3.2] from k = 2 to any k > 3. See below for a proof. 

Lemma 9. If there exists k families of vectors {\A\(xi)) , . . . ,\A l r (xi))} C C d 
for all i with 2 < i < k and Xi S {0, 1}™ such that 

r 

\A\ixx)) (g> ••• <g) |A fc (^fc)) =0 i/ and on/y z//(xi, . . . ,x fc ) = 0, 

i=i 

then nrank(f) < r. 

Now we proceed to prove the lower bound in Theorem [TJ 
Lemma 10. NQ^ IH (f) > [log nrankifj] + 1 

Proof. Consider a NIH £-qubit protocol for /. By Lemma |8] its final state is 

M= \A l m {x l ))---\A k m {x k ))\rm). (1) 

me{0,l}' 

Assume all vectors have the same dimension d. Let S = {m 6 {0, l} 1 : mi = 1}, 
and consider only the part of the state that is projected onto the 1 state of the 
channel, 

Mx ll ...,x k ))=Y l \A) n {x 1 ))---\A k m {x k ))\l). (2) 

m£S 

The vector \(j>(xi, . . . , Xk)) is if and only if f(xi, . . . , x k ) — 0. Thus, by 
Lemma [9J we have that nrank(f) < \S\ = 2 e_1 , which implies the lower bound. 

□ 



Proof (Lemma G|). First note that the case k=2 was proven by de Wolf [17l 
Lemma 3.2]. Here we give a proof for k > 3. We divide it in two cases: when k 
is odd and even. 

Even k: There are k size-r families of <i-dimensional vectors. We will construct 
two new families of vectors denoted @ and & '. First, divide the k families in 
two groups of size fc/2. Then, tensor each family in one group together in the 
following way: for each family {\A\(xi)), . . . , \A l r (xi))} for 1 < i < fc/2 construct 
a new family 

!k/2 fc/2 1 

(g) \A{(xi)), . . . ,(g) \Ai(xi)) \ = jl^O/)), . . . , \A r (y))j, 



where y = (xx, . . . , x k / 2 )- Do the same to construct & for k/2 + 1 < i < k 
obtaining 



■^={ |Ai(»,)>,..., = ^|B 1 («)),...,|B P ( 2 r)) 

i=fe/2+l i=fc/2+l 

where z = (2^/2+1, • ■ • Thus, 3$ and J?" will become two size-r family of 
vectors, each vector with dimension dk/2. Then apply the theorem for k = 2 on 
these two families and the lemma follows. 

Odd k: Here we can use the same approach by constructing again two new 
families @ and & by dividing the families in two groups of size [k/2\ and \k/2~\ . 
However, although both families will have the same size r, the dimension of 
the vectors will be different. In fact, the dimension of the vectors in one family 
will be a" = d[k/2\ and in the other d' + 1. So, in order to prove the theorem 
we will consider having two size-r families {\A\(y)), . . . ,\A r (y))} C C d and 
{\B 1 (z)),...,\B r (z))}CC d '+\ 

Denote the entry of each vector \ Ai(y)), \Bi(z)) by Ai(y) u and Bi(z) v respec- 
tively for all (u, v) £ [d 1 ] x [d 1 + 1]. 

Note that, if f(y,z) = then 521= 1 A-i(y) u Bi(z) v = for all (u,v); if 
f(y,z) = 1 then J2l =1 Ai(y) u Bi(z) v ^ for some (u, v). This holds because 
each vector \Ai(y)) and \Bi{z)) are the set of vectors |A*(a;*)) tensored together 
and separated in two families of size \_k/2\ and \k/2~\ respectively. 

The following lemma was implicitly proved by de Wolf |17j for families of 
vectors with the same dimension. However, we show that the same arguments 
hold even if the families have different dimensionality (see Appendix [Bj . 

Lemma 11. Let I be an arbitrary set of real numbers of size 2 2n+1 , and let 
ai, ■ • ■ > Old' an d Pii ■ • ■ 1 fid'+i be numbers from I . Define the quantities 

d' d' + l 

a i(v) = aTld M Z ) = X! PvBi(z) v . 

u—1 v—1 

Also let 

r d' d' + l / r \ 

v{y,z) = ^2ai(y)bi(z) = X] au @ v ( A ^y)uB l (z) v J . 

i—1 u—1 v—1 \i=l / 

There exists ax, . . . ,ad',j3x, ■ ■ ■ ,Pd'+i £ I such that for every (y,z) G / (1) we 
have v(y, z) 7^ 0. 

Therefore, by the lemma above we have that v(y, z) = if and only if 
f(y,z) = 0. Now let |<Zj) and \bi) be 2™-dimensional vectors indexed by ele- 
ments from {0,1}™, and let M = 521=1 a i)(^l- Thus M is an order-fc tensor 
with rank r. □ 



Lemma 12. NQ* OF {f) < [log nrank(f)] + 1. 



The proof of Lemma [T2] follows by fixing a proper matrization (separating 
the cases of odd and even k) of the communication tensor, and then applying 
the 2-party protocol by de Wolf [T7] (see Appendix [Bj . 

4 A Quantum-Classical Super-polynomial Separation 

In this section, we show that there exists a function with a super-polynomial gap 
between classical and quantum NOF models of quantum strong nondctcrminism. 

Theorem 13. There is a super- polynomial gap between and NQ^ OF 

when k = o(logn), and exponential when k = 0(1). 

In particular, we analyze the following total function: Let X\ = ■ ■ ■ = Xk = 
{0, l} nx ™ be the set of all n x n boolean matrices. Also let X{ G Xi be a n x n 
boolean matrix, and denote by XiXj the multiplication of matrices Xi and Xj 
over the binary field. Define 

F(xi,...,Xk) = (xix 2 ■ ■ -Xk)n, 

i.e., F(xi, . . . , Xk) is the entry in the first row and first column in x\ • ■ ■ Xk- 

This matrix multiplication function was studied by Raz j!4j . who showed 
a fi(n/2 h ) lower bound in the blackboard model of NOF bounded-error com- 
munication. However, this lower bound also holds for the classical blackboard 
nondeterministic NOF communication denoted N k NOF (F). The reason is that 
the proof by Raz is based on an upper bound for discrepancy. Since N^ OF (f ) = 
f2(l/Disc(f)) for any / where Disc(f) is the discrepancy [11) . we immediately 
obtain the following corollary: 

Corollary 14. N^ OF (F) = !7(n/2 k ). 

The condition on the number of players in Theorem [T3] comes from this 
lower bound. Improving it will require new techniques for classical multiparty 
communication. 

Since any lower bound in the blackboard model also holds in the unicast 
model, we can use Corollary [14] to prove a separation for the unicast models in 
this paper. The following lemma implies the theorem. 

Lemma 15. NQ% OF (F) = O(fclogn). 

Proof. By Theorem [T] we just need to give a tensor with rank at most 0(n k ). 
Denote each entry of the matrix Xi by Xi[p, q], i.e., the (p, g)-entry of Xi. Also, 
all the operations in this proof are assumed to be over the binary field. 
Let 

T[xi,...,Xk] = (xi ■••x fc )ii, 



which is just the function F plugged into T. 



First, note that the multiplication is between n x n matrices. Hence, the 
maximum rank of the product is at most n. Therefore, we can write each entry 
of T as 



T[ Xl ,...,x k }= XX 1 ■• E4M = E (4 1 ---4 k )n. (3) 
\\h=i J \n=i J J n ii,..,A=i 

The notation a;^ can be interpreted as the j term in the rank decomposition 
of matrix Xi. Now fix j\, . . . ,j k , and by the definition of matrix multiplication 
we get that 

n 

(zf---4 fc )ii = E ^[l^i]^^.^---^^-!,!]- (4) 

ii,...,i fc _i=l 

Equations © and (0} have n fc and terms. Putting them both together, we 
have that T[x\, . . . ,Xk] have n 2k ~ 1 summands. This already have 0(n k ) terms; 
however, we need to make sure that each term in the summation defines a rank-1 
tensor. 

For each m £ { 1 , . . . , n k } define 

T m [xt, . . . , x k ] = xf [1, [i ls i 2 ] • • -x J k k [i k ~i, 1], (5) 

for some ji, . . . ,jk, «i, • • • , Xfc_i that directly corresponds to m (fix some bijection 
between m and ji, . . . ,jf., i\, ■ • ■ , Xk—i)- Then, let y%, . . . , y n xn 

e {0, l} nxn be 

an enumeration of all n x n boolean matrices. For instance, y\ is the all-0 matrix, 
and is the all-1 matrix. Define vectors 

|«i> = (y?[l,i 1 ],...,yil xn [l,ii]) and \v k ) = (yf [t k _i, 1], . . . ,y£* n [i k -u 1]) ; 
and for r — 2, . . . , k — 1 define 

Note that each vector has 2™ xn components, and are indexed by the set ofnxn 
boolean matrices. If we pick k matrices y^ , . . . , yi k , we get that 

T m [yi 1 ,...,Vi k ] = y{l[l,ii]...,yl*[ik-i,i\- (6) 
This way, T m = \vi) <g> V2} <8> • • ■ <S> |«fc) for all m. Thus, T m has rank 1, and 

«2k — 1 

To see that T m is indeed a rank-1 tensor, assume that rank(T m ) > 1. Then ^ 
has at least one extra summand. That extra summand can only come from (0} or 
((3|) . It cannot be from Q because that is the definition of matrix multiplication. 
If it were from Q , it would violate the assumption that each matrix xi has rank 
at most n, thus, yielding a contradiction. □ 



5 Concluding Remarks 



In this paper we studied strong quantum nondeterministic communication com- 
plexity in multiparty protocols. In particular, we showed that i) strong quantum 
nondeterministic NOF communication complexity is upper-bounded by the log- 
arithm of the rank of the nondeterministic communication tensor; ii) strong 
quantum nondeterministic NIH communication complexity is lower-bounded by 
the logarithm of the rank of the nondeterministic communication tensor. These 
results naturally generalizes previous work by de Wolf [T7]. Moreover, the lower 
bound on NIH is also a lower bound for quantum exact NIH communication. 
This fact was used to show a Q(n + log k) lower bound for the generalized inner 
product function. 

We also showed an exponential separation between quantum strong nonde- 
terministic communication and classical nondeterministic communication in the 
NOF model. To our knowledge, this is the first separation for a total function 
in any multiparty model. It remains as an open problem, a separation (of any 
kind) between other multiparty models, e.g., bounded-error, NIH, etc. 

In order to prove strong lower bounds using tensor-rank in NIH, we need 
stronger construction techniques for tensors. The fact that computing tensor- 
rank is NP-complete suggests that this could be a very difficult task. Alternatives 
for finding lower bounds on tensor-rank include computing the norm of the 
communication tensor, or a hardness result for approximating tensor-rank. 
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A Rank Lower Bound on GIP^ 

Lemma 16. nrank(GIP k ) > (2" - l)k/2. 

Proof. First, we start by generalizing the concept of rows and columns for ten- 
sors. Define a fiber to be a vector obtained by fixing every index except by one. 
Thus, a matrix column is a mode-1 fiber, and a row is a mode-2 fiber. For order-3 
tensors, we have columns, rows and tubes, and so on for higher order tensors. 
In general, a mode-i fiber is a vector obtained by fixing every but except the i th 
index. In the same way we define a slice to be a two-dimensional section of T 
obtained by fixing all but two indices. 

Here we will consider a particular form of matrization. Let T e c niX "' x ™* 
be an order-fc tensor, with m = 2" for every i. The i-mode unfolding of T, de- 
noted T(j), is the matrix obtained by arranging the i-mode fibers as columns. 
The permutations of the columns of is not important, as long as the corre- 
sponding operations remain consistent [6] . Define the i-rank of T as ranki (T) = 
rank(T(ij). It is trivial that ranki(T) < rank(T) for every i [8]. 

Now we proceed with the proof. Let T be the order-fc communication tensor 
for GIPk- Let Mjp n be the communication matrix for GIP2, i.e., the 2-party 
inner product function on n bits. It is well known that rank(Mip n ) = 2™ — 1 
(cf. [3 Example 1.29]). 

Fix the x' 3 , . . . , x' k inputs to be the all-1 strings and consider the (x' 3 , . . . , x' k )- 
slice of T denoted T x < ... x / . Then rank(T x i ,„ x < ) = rank(Mjp n ) = 2™ — 1, because 



by fixing xs, ...,Xk to all Is, the entries of T become (a;i|x2) for all x±,x 2 G 
{0,1}". 

Let denote the string x with the i th bit flipped. For i = 3, . . . , k consider 



the slice T , , , w of T. Then 



, <(i)[xi,X 2 ] = (Xi\x 2 ) -X U X2i, 



x 3--- x k — l X k 



where the non-zero entries agrees with the non-zero entries of Mjp n _ 1 by deleting 
the i th bits of x 1 and x 2 . Thus, rank(T , , , w ) = (2 n - l)/2. 

^3" '^fc— l^fc 

The 1-mode unfolding of T is obtained by fixing every index except x%. Thus 



T x ' 3 ...x' k T x , x i(3) ■ ■ ■ 



with 2^ k 1 ) n columns. We known that T x < and T , tu) for each i = 3, . . . , k 

3 " ' k X g . . . X 

have (2™ — 1) and (2" — l)/2 linearly independent columns respectively. Also, 
each of these columns are pair- wise linearly independent. Thus, rank\{T) > 
(2™ - l)fc/2, which implies rank(T) > (2™ - l)fc/2. □ 



B Proofs of Technical Lemmas 
B.l Proof of Lemma 1111 

If f(y, z) = then v(y, z) = for all a u , (3 V . If f(y, z) ^ there exists (u 1 , v') such 
that v(y, z) 7^ 0. Here we use the same arguments given by de Wolf |17j . i.e., we 
show that v(y, z) = happens with small probability. In fact, having families 
of vectors with different dimensions does not affect the argument. Consider the 
situation where all a u and f3 v were chosen except a u > and j3 v i. Write v(y,z) in 
terms of these two coefficients 

v(y, z) = c Q a u i/3 v , + cia u > + c 2 /3 v > + c 3 , 

where Co = 52i= 1 Ai{y)u' Bi(z)v' 7^ 0. If we fix a u > then, v(y,z) is a linear 
equation with at most one solution (zero). Therefore, we have at most 2 2n+1 • 1 
ways of choosing a u i and fi v i such that v(y,z) = 0. Thus 

22n+l 2 2 "+ 2 

Pr[v(y,z)=0] < < 



(2 2 "+ 1 ) 2 (2 2n +!) 2 
By the union bound 

Pr[3( 2/ ,z)e/- 1 (l)s.t. w (y,z) = 0]< £ Pr[«( Vl z) = 0] < 2 2n -2- 2 " = 1. 

(x.^eZ-Hi) 



The following is a probabilistic method argument. Since the above probability is 
strictly less than 1, there exists with positive probability sets {ai(y), ... ,a r (y)} 
and {bi(z), . . . ,b r (z)} such that for every (x,y) e / _1 (1) we have v(y,z) ^ 0. 



B.2 Proof of Lemma [12] 



Let T be a nondctcrministic tensor for / with nrank(f) — r. We divide the 
proof in two cases. 

Even k: Fix two players, say Pi (Alice) and P k (Bob). Also fix some matrization 
of T, i.e., let M be such matrization and consider it as an operator M : H k /2+i ® 
■■■ ®H k -> Hi® ■■■ ® H k /2- Thus M is a 2 fe "/ 2 x 2 fe "/ 2 -matrix that maps 
elements from the Hk/2+1 ® • • • ® Hk subspace to the Hi®---® Hk/2 subspace. 
Let also M = USV be the singular value decomposition of M such that U, V 
are 2 kn / 2 x 2 fe "/ 2 unitary matrices, and S is a 2 fe ™/ 2 x 2 fen / 2 diagonal matrix 
containing the singular values of M in the diagonal. The number of singular 
values is at most rank(M) < r. 

Bob computes the state \4>i--- k /2) — c i---k/2^V\xi, ■ ■ ■ , Xfc/2) where Ci,,, k i% is 
some normalizing constant that depends on Xi, . . . , Since only the first en- 
tries of S are non-zero, \4'i---k/2) has at most r non-zero entries, so the state can 
be compressed using logr qubit^f]. Bob send these qubits to Alice. Alice then 
computes U\<pi...k/2) an d measure that state. If Alice observes x^/2+i, ■■■ , £fc 
then she puts a 1 on the qubit channel, and otherwise she puts a 0. The proba- 
bility of Alice putting a 1 on the channel is 

\(xk/2+i,- ■ ■ ,x k \U\4> 1 ... k /2)\ 2 = |cx...,fe/2 1 2 \ {x k /2+i,- ■ .,x k \USV\xi, . . . ,x fc/2 )| 2 

= \ c l...,k/2\ 2 |(Zfc/2+l> ' • ' ,Xk\M\xi, . . . ,X k / 2 )\ 2 

= \ci... tk/2 \ 2 \M[ Xl ,...,x k }\ 2 
= \ c i...,k/2\ 2 |T[a;i,...,a; fe ]| 2 . 

Since T[xi, . . . , x k ] is non-zero if and only if f(xi, . . . , x k ) — 1, this probability 
will be positive if and only if f(xi, . . . , x k ) = 1. Thus, this is a nondctcrministic 
protocol with total cost logr + 1. 

Odd k: To use the protocol given in the even case, we add an extra degree of 
freedom to T. 

Lemma 17. If T is an order-k tensor with rank r then, there exists a tensor T' 
of order k + 1 with rank r where T[xi, . . . , x k ] = T'[xi, . . . , x k x k +i] for all x k +\ . 

By the above lemma we have that T'[xi, . . . , x k x k +i] — if and only if 
f(xi, . . . ,x k ) — for any given x k+ \. See Subsection IB . 3 1 for a proof. 

Before the protocol starts, each player knows T' (which has even order) and 
its matrization M'. We fix two players, Pi (Alice) and P k (Bob), and they can 
now use the protocol for even k. 

8 A n dimensional vector can be encoded as a quantum state with logn qubits by 
observing that a fc-qubit state is a 2 fc -dimensional vector. This fact was used by Raz 
to show an exponential separation between classical and quantum 2-party commu- 
nication [13] . 



B.3 Proof of Lemma Q7| 

Let T = Y^i=i \ v \) ' ' ' \ v k) ^ or some family of d-dimensional vectors. Define T' = 
Y^i=i \ v i) ' ' ' \ v k)\ v k+i) where each \v l k+1 ) is the all-1 vector. Thus, component- 
wise we have that 

r 

T[xi, ...,x k ] = ^v\{x l ) ■ ■•v\{x-k), 

i=l 

and 

r 

T'[xi, . . . ,x k x k+ i] = • ■■v k (x k )v k+1 (x k +i), 

i=l 

where v k+1 (xk+i) — 1 for all i and for all inputs x k +x- Then T'[x\, . . . , XkXk+i] = 
Yh=i v K x i) ■ ■■ v l( x k) and T'[xi, . . . ,x k x k+ i] = T[xi, ...,x k ] for any x k+1 . 



